253 research outputs found

    ALBERTI, a Multilingual Domain Specific Language Model for Poetry Analysis

    Full text link
    The computational analysis of poetry is limited by the scarcity of tools to automatically analyze and scan poems. In a multilingual settings, the problem is exacerbated as scansion and rhyme systems only exist for individual languages, making comparative studies very challenging and time consuming. In this work, we present \textsc{Alberti}, the first multilingual pre-trained large language model for poetry. Through domain-specific pre-training (DSP), we further trained multilingual BERT on a corpus of over 12 million verses from 12 languages. We evaluated its performance on two structural poetry tasks: Spanish stanza type classification, and metrical pattern prediction for Spanish, English and German. In both cases, \textsc{Alberti} outperforms multilingual BERT and other transformers-based models of similar sizes, and even achieves state-of-the-art results for German when compared to rule-based systems, demonstrating the feasibility and effectiveness of DSP in the poetry domain.Comment: Accepted for publication at SEPLN 2023: 39th International Conference of the Spanish Society for Natural Language Processin

    A decomposition-based uncertainty quantification approach for environmental impacts of aviation technology and operation

    Get PDF
    As a measure to manage the climate impact of aviation, significant enhancements to aviation technologies and operations are necessary. When assessing these enhancements and their respective impacts on the climate, it is important that we also quantify the associated uncertainties. This is important to support an effective decision and policymaking process. However, such quantification of uncertainty is challenging, especially in a complex system that comprises multiple interacting components. The uncertainty quantification task can quickly become computationally intractable and cumbersome for one individual or group to manage. Recognizing the challenge of quantifying uncertainty in multicomponent systems, we utilize a divide-and-conquer approach, inspired by the decomposition-based approaches used in multidisciplinary analysis and optimization. Specifically, we perform uncertainty analysis and global sensitivity analysis of our multicomponent aviation system in a decomposition-based manner. In this work, we demonstrate how to handle a high-dimensional multicomponent interface using sensitivity-based dimension reduction and a novel importance sampling method. Our results demonstrate that the decomposition-based uncertainty quantification approach can effectively quantify the uncertainty of a feed-forward multicomponent system for which the component models are housed in different locations and owned by different groups. Keywords: Aviation Environmental Impact; Decomposition; Global Sensitivity Analysis; Uncertainty Quantificatio

    Píldoras Educativas para la Elaboración del Trabajo Final de Grado en Estudios Ingleses (Lengua y Lingüística).

    Get PDF
    Este proyecto plantea elaborar un módulo, estructurado en píldoras educativas (o mini-videos didácticos), para dar una visión general a los estudiantes del último curso de grado de cómo abordar la escritura del TFG

    A genome-wide association study follow-up suggests a possible role for PPARG in systemic sclerosis susceptibility

    Get PDF
    Introduction: A recent genome-wide association study (GWAS) comprising a French cohort of systemic sclerosis (SSc) reported several non-HLA single-nucleotide polymorphisms (SNPs) showing a nominal association in the discovery phase. We aimed to identify previously overlooked susceptibility variants by using a follow-up strategy.<p></p> Methods: Sixty-six non-HLA SNPs showing a P value <10-4 in the discovery phase of the French SSc GWAS were analyzed in the first step of this study, performing a meta-analysis that combined data from the two published SSc GWASs. A total of 2,921 SSc patients and 6,963 healthy controls were included in this first phase. Two SNPs, PPARG rs310746 and CHRNA9 rs6832151, were selected for genotyping in the replication cohort (1,068 SSc patients and 6,762 healthy controls) based on the results of the first step. Genotyping was performed by using TaqMan SNP genotyping assays. Results: We observed nominal associations for both PPARG rs310746 (PMH = 1.90 × 10-6, OR, 1.28) and CHRNA9 rs6832151 (PMH = 4.30 × 10-6, OR, 1.17) genetic variants with SSc in the first step of our study. In the replication phase, we observed a trend of association for PPARG rs310746 (P value = 0.066; OR, 1.17). The combined overall Mantel-Haenszel meta-analysis of all the cohorts included in the present study revealed that PPARG rs310746 remained associated with SSc with a nominal non-genome-wide significant P value (PMH = 5.00 × 10-7; OR, 1.25). No evidence of association was observed for CHRNA9 rs6832151 either in the replication phase or in the overall pooled analysis.<p></p> Conclusion: Our results suggest a role of PPARG gene in the development of SSc

    Cross-disease Meta-analysis of Genome-wide Association Studies for Systemic Sclerosis and Rheumatoid Arthritis Reveals IRF4 as a New Common Susceptibility Locus

    Get PDF
    Objectives: Systemic sclerosis (SSc) and rheumatoid arthritis (RA) are autoimmune diseases that share clinical and immunological characteristics. To date, several shared SSc- RA loci have been identified independently. In this study, we aimed to systematically search for new common SSc-RA loci through an inter-disease meta-GWAS strategy. Methods: We performed a meta-analysis combining GWAS datasets of SSc and RA using a strategy that allowed identification of loci with both same-direction and opposingdirection allelic effects. The top single-nucleotide polymorphisms (SNPs) were followed-up in independent SSc and RA case-control cohorts. This allowed us to increase the sample size to a total of 8,830 SSc patients, 16,870 RA patients and 43,393 controls. Results: The cross-disease meta-analysis of the GWAS datasets identified several loci with nominal association signals (P-value < 5 x 10-6), which also showed evidence of association in the disease-specific GWAS scan. These loci included several genomic regions not previously reported as shared loci, besides risk factors associated with both diseases in previous studies. The follow-up of the putatively new SSc-RA loci identified IRF4 as a shared risk factor for these two diseases (Pcombined = 3.29 x 10-12). In addition, the analysis of the biological relevance of the known SSc-RA shared loci pointed to the type I interferon and the interleukin 12 signaling pathways as the main common etiopathogenic factors. Conclusions: Our study has identified a novel shared locus, IRF4, for SSc and RA and highlighted the usefulness of cross-disease GWAS meta-analysis in the identification of common risk loci

    Overview of recent TJ-II stellarator results

    Get PDF
    The main results obtained in the TJ-II stellarator in the last two years are reported. The most important topics investigated have been modelling and validation of impurity transport, validation of gyrokinetic simulations, turbulence characterisation, effect of magnetic configuration on transport, fuelling with pellet injection, fast particles and liquid metal plasma facing components. As regards impurity transport research, a number of working lines exploring several recently discovered effects have been developed: the effect of tangential drifts on stellarator neoclassical transport, the impurity flux driven by electric fields tangent to magnetic surfaces and attempts of experimental validation with Doppler reflectometry of the variation of the radial electric field on the flux surface. Concerning gyrokinetic simulations, two validation activities have been performed, the comparison with measurements of zonal flow relaxation in pellet-induced fast transients and the comparison with experimental poloidal variation of fluctuations amplitude. The impact of radial electric fields on turbulence spreading in the edge and scrape-off layer has been also experimentally characterized using a 2D Langmuir probe array. Another remarkable piece of work has been the investigation of the radial propagation of small temperature perturbations using transfer entropy. Research on the physics and modelling of plasma core fuelling with pellet and tracer-encapsulated solid-pellet injection has produced also relevant results. Neutral beam injection driven Alfvénic activity and its possible control by electron cyclotron current drive has been examined as well in TJ-II. Finally, recent results on alternative plasma facing components based on liquid metals are also presentedThis work has been carried out within the framework of the EUROfusion Consortium and has received funding from the Euratom research and training programme 2014–2018 under Grant Agreement No. 633053. It has been partially funded by the Ministerio de Ciencia, Inovación y Universidades of Spain under projects ENE2013-48109-P, ENE2015-70142-P and FIS2017-88892-P. It has also received funds from the Spanish Government via mobility grant PRX17/00425. The authors thankfully acknowledge the computer resources at MareNostrum and the technical support provided by the Barcelona S.C. It has been supported as well by The Science and Technology Center in Ukraine (STCU), Project P-507F

    Assessing associations between the AURKAHMMR-TPX2-TUBG1 functional module and breast cancer risk in BRCA1/2 mutation carriers

    Get PDF
    While interplay between BRCA1 and AURKA-RHAMM-TPX2-TUBG1 regulates mammary epithelial polarization, common genetic variation in HMMR (gene product RHAMM) may be associated with risk of breast cancer in BRCA1 mutation carriers. Following on these observations, we further assessed the link between the AURKA-HMMR-TPX2-TUBG1 functional module and risk of breast cancer in BRCA1 or BRCA2 mutation carriers. Forty-one single nucleotide polymorphisms (SNPs) were genotyped in 15,252 BRCA1 and 8,211 BRCA2 mutation carriers and subsequently analyzed using a retrospective likelihood appr
    corecore